A Fixed-bandwidth View of the Pre-asymptotic Inference for Kernel Smoothing with Time Series Data
نویسندگان
چکیده
This paper develops robust testing procedures for nonparametric kernel methods in the presence of temporal dependence of unknown forms. Based on the xed-bandwidth asymptotic variance and the pre-asymptotic variance, we propose a heteroskedasticity and autocorrelation robust (HAR) variance estimator that achieves double robustness it is asymptotically valid regardless of whether the temporal dependence is present or not, and whether the kernel smoothing bandwidth is held constant or allowed to decay with the sample size. Using the HAR variance estimator, we construct the studentized test statistic and examine its asymptotic properties under both the xed-smoothing and increasing-smoothing asymptotics. The xed-smoothing approximation and the associated convenient t-approximation achieve extra robustness it is asymptotically valid regardless of whether the truncation lag parameter governing the covariance weighting grows at the same rate as or a slower rate than the sample size. Finally, we suggest a simulation-based calibration approach to choose smoothing parameters that optimize testing oriented criteria. Simulation shows that the proposed procedures work very well in nite samples. Keywords: heteroskedasticity and autocorrelation robust variance, calibration, xed-smoothing asymptotics, xed-bandwidth asymptotics, kernel density estimator, local polynomial estimator, t-approximation, testing-optimal smoothing-parameters choice, temporal dependence JEL Classi cation Number : C12, C14, C22 Email: [email protected], [email protected], and [email protected]. For helpful comments, we thank seminar and conference participants at OSU, U of Guelph, CESG, KAEA-KEA, and 2015 ESWC. Kim gratefully acknowledges research support from SSHRC (430-2015-00527). Sun gratefully acknowledges partial research support from NSF under Grant No. SES-1530592. Address correspondence to Yixiao Sun, Department of Economics, UCSD, 9500 Gilman Drive, La Jolla, CA 92093-0508, USA. 1 Introduction This paper proposes new robust testing procedures for nonparametric kernel methods in the presence of temporal dependence of unknown forms. An important issue in hypothesis testing with time series data is how to take nonparametric dependence into account in calculating the standard error. Dependence is typical in time series, and an estimator with positively dependent data tends to have larger variation than that with iid data. Therefore, if temporal dependence is not properly considered, we may have an over-rejection problem. For parametric models this has been a well-researched problem since Newey and West (1987) and Andrews (1991). It is now standard practice to use the heteroskedasticity and autocorrelation robust (HAR) standard error in empirical studies. No such procedure for nonparametric kernel methods has been proposed in the literature. The fact that the distribution of a kernel estimator with dependent data is asymptotically equivalent to the distribution with iid data may have masked the need for more robust nonparametric kernel methods in nite samples. See Robinson (1983) for detail on the asymptotic equivalence. This is in sharp contrast to the parametric case. The asymptotic equivalence implies that the usual standard error formula with iid data is still valid for time series data in an asymptotic sense. However, in nite samples, temporal dependence does a¤ect the sampling distribution of a kernel estimator. In particular, when a process is highly persistent and/or the sample size is not large enough, the asymptotic variance tends to understate the true nite sample variation of a kernel estimator, and this understated variation causes the usual asymptotic test to over-reject in nite samples. See, for example, Conley, Hansen and Liu (1997) and Pritsker (1998), who discuss this problem in kernel density estimation based on short term interest rates. In developing new and more accurate testing procedures, the paper makes several contributions. Firstly, based on the xed-bandwidth asymptotic variance and the pre-asymptotic variance of a kernel estimator, we construct a kernel based HAR variance estimator that captures temporal dependence. The pre-asymptotic approach has also been used in Chen, Liao and Sun (2014) in sieve inference on time series models. Here the pre-asymptoticvariance is further justi ed using the new xed-bandwidth asymptotics where the kernel-smoothing bandwidth h is held xed. The proposed HAR variance estimator achieves double robustness it is asymptotically valid regardless of whether the temporal dependence is present or not, and whether the kernel smoothing bandwidth is held constant or allowed to decay with the sample size. Secondly, we consider the asymptotic properties of the HAR variance estimator and the associated test statistics under various speci cations of the smoothing parameters. There are two smoothing parameters in our testing procedures. The rst is the kernel smoothing bandwidth parameter h, and the second is the truncation lag parameter b for covariance weighting; which is parametrized as the ratio of the truncation lag ST to the sample size T: Both h and b can be xed or small in our asymptotic speci cations. Under the small-h speci cation, h ! 0 but hT ! 1: Similarly, under the small-b speci cation, b! 0 but bT !1: Under the xed-h and small-b speci cation, the asymptotic properties of the HAR variance estimator and the associated test statistic resemble what one would obtain in a parametric setting. Under the small-h and small-b speci cation, the asymptotic bias and variance of the HAR variance estimator are determined jointly by h and b: This is in contrast with the parametric setting where the bias and variance trade-o¤ is dictated by b only. Regardless of whether h is xed or small, the xed-b asymptotics delivers new limiting dis-
منابع مشابه
Nonparametric functional central limit theorem for time series regression with application to self-normalized confidence interval
This paper is concerned with the inference of nonparametric mean function in a time series context. The commonly used kernel smoothing estimate is asymptotically normal and the traditional inference procedure then consistently estimates the asymptotic variance function and relies upon normal approximation. Consistent estimation of the asymptotic variance function involves another level of nonpa...
متن کاملDiscrimination of time series based on kernel method
Classical methods in discrimination such as linear and quadratic do not have good efficiency in the case of nongaussian or nonlinear time series data. In nonparametric kernel discrimination in which the kernel estimators of likelihood functions are used instead of their real values has been shown to have good performance. The misclassification rate of kernel discrimination is usually less than ...
متن کاملImproved Testing Procedure for Kernel Methods with Time Series Data
This paper develops a robust testing procedure for the nonparametric kernel method in the presence of temporal dependence of unknown forms. We rst propose a new variance estimator that corrects the nite sample bias caused by temporal dependence. The new variance estimator is novel in the sense that it is not only robust to temporal dependence in nite samples but also consistent in large samp...
متن کاملAsymptotic Behaviors of Nearest Neighbor Kernel Density Estimator in Left-truncated Data
Kernel density estimators are the basic tools for density estimation in non-parametric statistics. The k-nearest neighbor kernel estimators represent a special form of kernel density estimators, in which the bandwidth is varied depending on the location of the sample points. In this paper, we initially introduce the k-nearest neighbor kernel density estimator in the random left-truncatio...
متن کاملAdaptive bandwidth selection in the long run covariance estimator of functional time series
In the analysis of functional time series an object which has seen increased use is the long run covariance function. It arises in several situations, including inference and dimension reduction techniques for high dimensional data, and new applications are being developed routinely. Given its relationship to the spectral density of finite dimensional time series, the long run covariance is nat...
متن کامل